Briefly Noted

نویسندگان

  • Marc Domenig
  • Pius ten Hacken
چکیده

This book describes Word Manager (WM), a lexical database system intended to serve a wide range of NLP applications. The most striking aspect of WM itself, and of this presentation, is its windowed user interface. The book consists of five chapters ('Introduction,' 'The Word Manager Approach,' 'Inflection,' 'Wordformation' [sic], and 'Linguistic Theories') and a lengthy appendix giving syntax definitions for the user language. A WM linguistic description consists of a tree-shaped hierarchy whose nodes contain declarations, feature specifications, and rules and entries of various kinds. Each node is identified by a feature set and associated with a window, the label of the node being inherited by those below it in the tree. Affixation in its simplest form is indicated by juxtaposing feature specifications that identify the segments concerned. More complex cases requiring spelling adjustments at segment boundaries are treated by means of 'match and map' rules. These are regular expression pattern matchers with bindings and substitutions, and may be paired with feature sets that govern their application. WM spelling rules differ from the more familiar two-level variety in being ordered. So, for example, the plural form amici of the Italian amico 'friend' can be produced by adding h to the stem (as would be done for the normal case baco, bachi 'silkworm') and then removing it again. The authors adopt without comment extrinsic rule ordering and radical non-monotonicity when elsewhere the tendency, shared by many theoretical linguists, has for some time been to abandon them. One reason for choosing a hierarchical organization is that redundancy can be minimized through the use of inheritance. However, several aspects of WM give rise to unnecessarily redundant specifications. Patterns in spelling rules must apparently subsume the entire string against which they match; this leads to the presence of repeated subexpressions whose only function is to skip irrelevant characters. What has been missed here is the fact that the phenomena these rules are intended to handle are essentially boundary effects. The pattern-matching component of the rules shows no sign of having been designed for, or even adapted to, the purpose for which it is employed. Similarly, it seems necessary to specify for each word segment mentioned in a rule not only its lexical form but also all of its surface forms, in addition to supplying spelling rules that implicitly express the same correspondence. The intention is to permit cross-checking during compilation, but the tracing facilities offered by WM should make this unnecessary. There is a general impression of piecemeal design, almost as if the shell of WM had been developed without regard to linguistic considerations and then fleshed out with rules and entries at the last moment when it was too late to change anything. But the real weak point of this book lies less in WM itself than in the presentation. Typically, this proceeds by describing some morphological phenomenon, and comparing two or three possible analyses. The emphasis is entirely on examples and the syntax of the system; nowhere do we find a clear statement of how the syntax is to be interpreted, an account of the formal properties of the various mechanisms employed, or proper motivation for the choice of these mechanisms rather than others. There are some interesting ideas hidden below the surface (morphological rules can be specialized to handle exceptions, feature values are used to encode paths through the hierarchy), but it is hard to evaluate them in this form. The first chapter sets out to provide justification for WM and draws comparisons with other approaches to morphology and lexical organization. The authors' awareness of such work seems quite rudimentary: the three contrasted approaches are finite-state lexicons with two-level rules (Koskenniemi 1983), DATR (Evans and Gazdar 1990), and the Celex databases. No mention is made of, for example, the lexical knowledge base created for the Acquilex project (Copestake [1992] is a recent report, but the project has been well publicized for much longer), or the large amount of work that has been done by Bear (1988), Trost (1991), and oth-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simulating Societies using Distributed AI

This paper discusses the prospects for using Distributed AI techniques to support the computer simulation of societies. Newly developed ideas and techniques are reviewed, some relevant projects are briefly described, and some potential pitfalls are noted.

متن کامل

The Elastic-plastic Mechanics of Crack Extension

This paper briefly reviews progres~ in the elastic plastic analysis of crack extension. Analytical results for plane strain and plane stress deformation fields are noted, and elastic-plastic fracture instability as well as transitional behavior and combined rate and thermal effects are discussed.

متن کامل

Mallory's ('alcoholic') hyaline in primary biliary cirrhosis.

Mallory's (;alcoholic') hyaline has been found in hepatocytes in 18 of 70 patients with primary biliary cirrhosis. These inclusions have previously been noted in only three cases of primary biliary cirrhosis. Current views on the nature of Mallory's hyaline are briefly discussed.

متن کامل

How Far Can You Trust A Computer?1

The history of attempts to secure computer systems against threats to confidentiality, integrity, and availability of data is briefly surveyed, and the danger of repeating a portion of that history is noted. Areas needing research attention are highlighted, and a new approach to developing certified systems is described.

متن کامل

Equivalence relations and behavior: an introductory tutorial.

With an emphasis on procedural fundamentals, the original behavior-analytic equivalence experiments and the equivalence paradigm are described briefly. A few of the subsequent developments and implications are noted, with special reference to the possible significance of the findings with respect to language and cognition.

متن کامل

Development of compound semiconductor detectors at ESA

Some examples of space-borne applications that require improvements in detector technology compared with conventional Si and Ge designs are described. Properties of compound semiconductors are noted, and a range of different detector developments are briefly reviewed. Material fabrication improvements for several compound semiconductors have resulted in near Fano-limited performance.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002